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REMARKS 

I. Prosecution History and Explanation of Amendments. 

The application as filed contained 1 1 claims. In an official communication (Paper 
No. 10) dated October 3, 2002, claims 1-1 1 were subjected to a restriction requirement. In a 
responsive Amendment and Response to Restriction Requirement (Paper No. 13) filed on 
March 17, 2003, the Applicants added claims 12-13; and elected with traverse to prosecute 
the invention of Group I, claims 1, 2, 5, 6, 8, and 9, directed to a polynucleotide comprising 
the nucleotide sequence set forth in SEQ ID NO: 1 . At the time of issuance of the 
outstanding Office Action, claims 1, 2, 5, 6, 8, 9, and 11-13 were pending in the application. 
In an Office Action dated June 24, 2003, claims 1, 2, 5, 6, 8, 9, and 1 1-13 were rejected 
variously under 35 U.S.C. §101, §102, §103, and §112, first and second paragraphs. Claims 

I, 2, 5, 8, 9, and 11-13 have been amended and new claim 14 has been added. 

II. Explanation of amendments. 

The amendments to the specification and the claims and the response made herein 
adopt suggestions made by the Examiner in an effort to advance this application4oward 
allowance. The claim for priority finds support in the original filing papers of this 
application. Other amendments to the specification are intended solely to correct 
typographical errors. No new matter has been added. 

Claims 1, 2, 5, 8, 9, and 1 1-13 as amended find support throughout the application 
(see especially Examples 1-8), and adopt the Examiner's recommendations to overcome the 
rejections. New Claim 14 finds support in the application at page 6, line 23 to page 7, line 1. 
The Applicants do not intend by these or any other amendments to abandon the subject 
matter of any claim as originally presented, and reserve the right to pursue such subject 
matter in other applications, such as continuing applications and divisional applications. 

Fig. 7 as amended finds support throughout the application (see especially SEQ ID 
NOs: 1-2 and the amino acid alignment illustrated in Figs. 5-6). 
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III. The Patent Office's rejection of claims 9 and 11 under 35 U.S.C. §101 should be 
withdrawn. 

At page 3, paragraph 7 of the Office Action, the Patent Office rejected claims 9 and 
11 under 35 U.S.C. §101, alleging that the claims were "not differentiable from the protein as 
it exists in nature." The Examiner suggested that the terms "isolated and purified" may be 
inserted into claim 9. Claims 9 and 1 1 as amended now recite an "isolated collectin protein" 
in accordance with the Patent Office's suggestion. Because the bases for rejection of claim 9 
and its dependent claim 1 1 for relating to non-statutory subject matter is now moot, the 
rejection should be withdrawn. Likewise, this rejection should not be extended to new claim 
14 which depends from claim 11. 



IV. The Patent Office's rejection of claim 12 under 35 U.S.C. §112, first paragraph 
should be withdrawn. 

At page 4, paragraph 7 of the Office Action, the Patent Office rejected claim 1-12 
under 35 U.S.C. § 1 12, first paragraph, alleging that these claims were not enabled by the 
specification. The Patent Office alleged that the claims should recite starting with the 
complement to the coding strand (SEQ ID NO: 1) because strands that hybridize to the 
coding strand would not encode a collectin. 

,A 

Claim 12 as amended now recites starting with a " a probe which is complementary to 
the nucleotide sequence set out in SEQ ED NO: 1" in accordance with the Patent Office's 
suggestion. Because the basis for rejection of claim 12 for lack of enablement is now moot, 
the rejection should be withdrawn. 

V. The Patent Office's rejection of claims 5 and 11 under 35 U.S.C. §112, second 
paragraph, should be withdrawn. 

At page 4, paragraph 6 of the Office Action, the Patent Office rejected claims 5 and 
1 1 under 35 U.S.C. § 1 12, second paragraph, alleging that these claims were indefinite. As 
the basis for rejection, the Patent Office alleged that claim 5 was indefinite because: 

"the sequence of bases even where they hybridize to the 
noncoding complementary strand and as a sequence that 
encodes a protein with antiviral function, a Ca+2 carbohydrate 
recognition domain, a neck region, a collagen like region, and 
an N-terminal cysteine only define the function but not the 
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sequence of the polynucleotide that was claimed by a structure. 
Where claim 5 recites SEQ ID NO:l, the size of the noncoding 
complementary strand is undefined. The claim should indicate 
the SEQ ID NO of the encoded protein as well. The blocking 
agent recited in line 5 does not appear to be defined as to what 
is the agent in the claim nor in the specification and the present 
response fi-om applicant does not point to any particular pages 
and lines of the specification that define the blocking agent." 

In addition, the Patent Office alleged that claim 1 1 was indefinite because neither 
claim 8 nor claim 9, fi-om which claim 1 1 depends, 

"permit variation of deletion, substitution, and/or addition of 
one or more amino acids in the coUectin protein. Claim 1 1 is 
unclear as to how deletions, substitutions, and/or additions 
which are not permissible fi-om the language of current claims 8 
and 9 fi-om which claim 1 1 depends is narrowed by expanding 
the limits of claims 8 and 9 by the language of claim 11," 

The rejection of claims 5 and 1 1 under 35 U.S.C. §112, second paragraph for 
indefiniteness should be withdrawn because claim 5 recites that the polynucleotide comprises 
a nucleotide sequence which hybridizes to a non-coding strand complementary to that set out 
in SEQ ID NO: 1 . Hybridization conditions are a common and accepted way to define a 
genus of polynucleotides. Claim 5 is consistent with Example 9 of the PTO*s Training 
Materials for the Interim Written Description and Utility Guidelines. In Examples 1-4 of the 
specification the claimed polynucleotide is obtained by the steps comprising: 1) identifying 
molecules with highly conserved regions with the collectin set out in SEQ ID NO: 3 (see 
Example 1), preparing probes fi:*om the clones so obtained for screening (see Example 2), 
screening by hybridization of a cDNA library firom human liver for complementary 
polynucleotides (see Example 3), and determining the base sequences of the cDNA (see 
Example 4). It should be noted that the Applicant succeeded in Example 3 through 
screening-by-hybridization to isolate polynucleotides which hybridize with the probes 
prepared in Example 2. From this process, the specification teaches one of ordinary skill in 
the art how to isolate the polynucleotides of claim 5. Protein encoded by the polynucleotides 
of claim 5 have been show to have the same activities (e.g. anti-viral activity, immuno- 
enhancing activity) as known coUectins. Claim 5 is directed to a polynucleotide that has the 
definite sequence properties to hybridize under the specified conditions and therefore the 
rejection should be withdrawn. 
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In addition, the term "blocking agent" is a term familiar to one of ordinary skill in the 
art of polynucleotide hybridization. Sambrook et al. Molecular Cloning: A Laboratory 
Manual (2""* ed.. Cold Spring Harbor Laboratory: New York ,1989), p. 9.49, Table 9.1 lists a 
variety of blocking agents used that were used in hybridization experiments as far back as 
1989. While not limited by the definition given in the specification, applicants point out that 
the term "blocking agent" recited in line 5 is definite because it can be defined, for example 
as DIG buffer II in Example 3 at page 10, line 30 of the specification. Therefore claims 5 and 
1 1 , which specify a blocking agent, are definite. 

Finally, amended claim 11, depending fi-om claims 8 and 9 no longer recites 
"deletion, substitution and/or addition of one or more amino acids." Amended claim 1 1 now 
specifies an isolated collectin protein according to Claim 8 or 9, wherein the collectin protein 
consists of (1) a Ca^'^-dependent carbohydrate recognition domain (CRD), (2) a neck region, 
(3) a collagen-like regions, and (4) an N-terminal region containing cysteine. The rejection 
should now be withdrawn with respect to claims 5 and 1 1 because the metes and bounds of 
the claims can be clearly determined. Likewise, this rejection should not be extended to new 
claim 14 which depends fi-om claim 11. 

VI. The Patent Office's rejection of claims 1, 2, 6, 8, 9, and 11 under 35 U.S.C. 
§1 02(b) should be withdrawn. 

To clarify the record, at page 6, paragraph 1 of the Office Action, the Patent Office 
alleges that: "This application currently names joint inventors." However, the patent 
application as originally filed indicated that Dr. Nobutaka Wakamiya was and is the sole 
inventor. 

In addition, at page 6, paragraph 2 of the Office Action, the Patent Office rejected 
claims 8, 9, and 1 1 under 35 U.S.C. § 102(b), alleging that the claims were anticipated by 
Hoppe et al.. In the Office Action, the Patent Office stated that: 

"The Hoppe et al. reference disclosed collectins which, absent 
factual evidence to the contrary, are anticipatory of claims 8, 9, 
and 1 1 because they contain any one or more deletions, 
substitutions, and/or additions .... Even where claims 8 and 9 
recite a sequence, when the claims include via comprises and 
the deletion, substitution, and/or addition language, the actual 
sequence is not defined and therefore, the polypeptide can have 
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any sequence such as would have been the polypeptide 
described in the Hoppe et al. reference." (Office Action at page 
6.) 

Applicants respectfully traverse. 

The rejection based on Hoppe et al., should be withdrawn because claims 8, 9, and 11, 
as amended, no longer recite "comprising" or "deletion, substitution and/or addition of one or 
more amino acids." Amended claim 8 now specifies an isolated collectin protein consisting 
of the amino acid sequence set out in SEQ ID NO:2. Amended claim 9 now specifies an 
isolated collectin protein consisting of the amino acid sequence encoded by the nucleotide 
sequence set out in SEQ ID NO: 1 . As noted above, amended claim 1 1 now specifies an 
isolated collectin protein according to Claim 8 or 9, wherein the collectin protein consists of 
(1) a Ca^^-dependent carbohydrate recognition domain (CRD), (2) a neck region, (3) a 
collagen-like regions, and (4) an N-terminal region containing cysteine. 

The rejection should now be withdrawn with respect to claims 8, 9, and 1 1 because 
Hoppe et al. does not teach an isolated collectin protein consisting of the amino acid sequence 
set out in SEQ ID NO:2 (claims 8 or 1 1) or an isolated collectin protein consisting of the 
amino acid sequence encoded by the nucleotide sequence set out in SEQ ID NO: 1 (claims 9 
and 11). Likewise, this rejection should not^be extended to new claim 14 which depends ^ 
from claim 11, New claim 14 is duly supported by the amino acid sequences set out in SEQ 
ID NOS: 1 and 2 and the amino acid alignment illustrated in Figures 5 and 6. 

In addition, at page 7, paragraph 1 of the Office Action, the Patent Office rejected 
claims 5 and 13 under 35 U.S.C. § 102(b), alleging that these claims were also anticipated by 
Hoppe et al. In the Office Action, the Patent Office stated that the Hoppe et al. reference 
disclosed coUectins which, "absent factual evidence to the contrary, are anticipatory of claims 
5 and 13 because claim 5 contains no sequence recitation as the result of the hybridization 
parameters." (Office Action at page 6.) Applicants respectfiilly traverse. 

The rejection based on Hoppe et al. should be withdrawn because amended claim 5 
and claim 13, which depends thereon, require that the sequence hybridize to SEQ ED NO: 1 
under certain parameters and because the nucleotide sequence disclosed in Hoppe et al. 
would not hybridize under the conditions recited. It is well known in the art that the 
hybridization temperature is lowered by about 1.4°C for every 1% mismatch in the 
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hybridizing DNAs. (See DNA - Basics of Structure and Analysis available at 
http://www.ndsu.nodak.edu/instruct/rncclean/;lsc731/dna/dna6.htni (Oct. 8, 2003). The DNA 
sequence disclosed in Hoppe et al. that is closest to the sequence as claimed is the Human 
Mannan Binding Protein (MBPl) cited in Table 1 (Taylor et al. attached hereto as Appendix 
2). There is no significant similarity (See Blast 2 Sequences results attached hereto as 
Appendix 3) between SEQ ID NO: 1 in the claims and the DNA sequence of human MBPl 
(Taylor et al., 1989), cited as the 102(b) art in Hoppe et al. by the examiner. Because there is 
no significant similarity of amino acid sequences (31% identity), there would be no 
significant similarity of the nucleotide sequences, and there would be no hybridization of the 
two DNA sequences, under the hybridization conditions cited in the claims. SEQ ID NO: 1 in 
the instant application is 44 % GC (704/1595 bases) So in 5X SSC ((log M [Na""]) = log 
0.825) its approximate melting temperature would be: Eff Tm = 81.5 + 16.6(log M [Na^]) + 

0. 41(%G+C). Therefore, the Eff T„, = 81.5 + 16.6 (log 0.825) + 0.41(44)^C = 98.2^C. Thus, 
in a hybridization that is performed at 55*^C (43.2°C lower than the melting temperature; 
100% - (43.2°C/1.4'*C) = 69.2% homology) there would be no detectable hybridization of the 
31% homologous MBPl sequence because 31% is so far below the approximate melting 
temperature. Consequently, the rejection should now be withdrawn with respect to claims 5 
and 13. 

Further, at page 6, paragraph 5 of the Office Action, the Patent Office rejected claims 

1, 2, and 6 under 35 U.S.C. § 102(b), alleging that the claims were anticipated by GenBank 
Accession Number AB00263 1 . In the Office Action, the Patent Office stated that: 

"While the reference would appear to be 24 Jan 1 999, the 
record indicates a submission of 4 Apr 1997 which, absent 
factual evidence to the contrary, predates the instant application 
filing date and the foreign priority date." (Office Action at 
page 6.) 

Applicants respectfiiUy traverse. 

Submitted herewith as Appendix 4 is the Rule 132 Declaration of Nobutaka 
Wakamiya in which he relates that the GenBank reference was not laid open to the public 
until June 24, 1999. Li addition, submitted herewith as Appendix 5 is an email fi-om Yoshio 
Tateno, Ph.D at the Center for Information Biology and DNA Data Bank of Japan, the 
collaborator with GenBank to which the sequence was submitted. As Dr. Tateno relates the 
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sequence data with accession number AB002631 was not laid open to the public until June 
24, 1999. As a consequence, GenBank Accession Number AB002631 is not 102(b) prior art. 
Accordingly, the rejection on the basis of GenBank Accession Number AB002631 should be 
withdrawn. 

Finally, at page 7, paragraph 5 of the Office Action, the Patent Office rejected claims 
5, 12, and 13 under 35 U.S.C. § 102(b), alleging that the claims were anticipated by Kawai et 
al.. In the Office Action, the Patent Office stated that: 

"claim 5 is anticipated since the claim does not per se require 
the polynucleotide obtained to must have the sequence of SEQ 
ID NO: 1 but only that it had hybridized under the recited 
conditions." (Office Action at page 8.) 

Applicants respectfully traverse. 

The rejection based on Kawai et al., should be withdrawn because amended claims 12 
and 5, and claim 13 which depends thereon, now require that the sequence hybridize to SEQ 
ID NO: 1 under certain parameters and because the nucleotide sequence disclosed in Kawai 
et al. would not hybridize under the conditions recited. As mentioned above, it is well known 
in the art that the hybridization temperature is lowered by about 1 .4°C for every 1% 
mismatch in the hybridizmg DNAs. The DNA sequence disclosed in Kawai et al. that is 
closest to the sequence as claimed is the Bovine Mannan Binding Protein (MBP). There is no 
significant similarity (See Blast 2 Sequences results attached hereto as Appendix 6) between 
SEQ ED NO: 1 in the claims and the DNA sequence of Bovine MBP, cited as the 102(b) art in 
Kawai et al. by the examiner. Because there is no significant similarity of the amino acid 
sequences (33% identity), there would be no significant similarity of the nucleotide 
sequences and there would be no hybridization of the two DNA sequences, especially under 
the hybridization conditions cited in the claims. SEQ ID NO: 1 in the instant application is 44 
% GC (704/1595 bases). So in 5X SSC ((log M [Na^]) = log 0.825) its melting temperature 
would be: Eff T^ = 81.5 + 16.6(log M [Na^]) + 0.41(%G+C). Therefore, the Eff Tm = 81.5 
+ 16.6 (log 0.825) + 0.41(44)°C = 98.2°C. Thus, in a hybridization that is performed at 55°C 
(43.2''C lower than the melting temperature; 100% - (43.2°C/1.4°C) = 69.2% homology) 
there would be no detectable hybridization of the 33% homologous MBPl sequence because 
31% is so far below the approximate melting temperature. Consequently, the rejection 
should now be withdrawn with respect to claims 5, 12, and 13. 
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VII. The Patent Office's rejection of claims 1, 2, 5, 6, 8, 9, and 11-13 under 35 U.S.C. 
§103 should be withdrawn. 

At page 8, paragraph 3 of the Office Action, the Patent Office rejected claims 1, 2, 5, 
6, 8, 9, and 1 1-13 under 35 U.S.C. § 103(a), alleging that the claims were obvious over Kawai 
et al. in view of GenBank Accession Number AB0022631 cited as a secondary reference. 
Applicants respectfully traverse. 

The rejection based on the combination of Kawai et al. and GenBank Accession 
Number AB002263 1 , should be withdrawn because, as noted above, the GenBank Accession 
Number reference is not prior art on the basis of publication on the same day as applicant's 
filing date. 



Withdrawal of the rejections and allowance of all pending claims in the application 
are respectfiiUy requested in view of the foregoing amendments and remarks. Should the 
Examiner wish to discuss any issues of form or substance in order to expedite allowance of 
the pending application, she is invited to contact the undersigned agent at the number 
indicated below. 

If there are any additional fees due in connection with the filing of this response, 
please charge the fees to our Deposit Account No. 13-2855. If a fee is required for an 
extension of time under 37 C.F.R. §1.136 not accounted for above, such an extension is 
requested and the fee should also be charged to our Deposit Account. 

Dated: November 20, 2003 Respectfully submitted. 



CONCLUSION 




Mark H. Hopkins, Ph.D. 
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Structure and evolutionary origin of the gene encoding a human 
serum mannose-binding protein 

Maureen E. TAYLOR/J Paul M. BRICKELL,t Roger K. CRAIGt§ and John A. SUMMERFIELD*|1 
♦Department of Medicine, St. Mary's Hospital Medical School, Norfolk Place, London W2 INY, 

and tThe Medical Molecular Biology Unit, Biochemistry Department, University College and Middlesex School of Medicine, 
London WIP 6DB, U.K. 



The A^-terminal sequence of the major human serum mannose-binding protein (MBPI) was shown to be 
identical at all positions determined with the amino acid sequence predicted from a cDNA clone of a human 
liver MBP mRNA. An oligonucleotide corresponding, to part of the sequence of this cDNA clone was used 
to isolate a cosmid genomic clone containing a homologous gener The intron/exon structure of this gene was 
found to closely resemble that of the gene encoding a rat liver MBP (MBP A). The nucleotide sequence of 
the exons differed in several places from that of the human cDNA clone published by Ezekowitz, Day & 
Herman [(1988) J. Exp. Med. 167, 1034-1046]. The MBP molecule comprises a signal peptide, a cysteine- 
rich domain, a collagen-like domain, a 'neck' region and a carbohydrate-binding domain. Each domain is 
encoded by a separate exon. This genomic organization lends support to the hypothesis that thie gene arose 
during evolution by a process of exon shuffling. Several consensus sequences that may be invblveid in 
controlling the expression of human serum MBP have been identified in the promoter region of the gene. 
The consensus sequences are consistent with the suggestion that this mammalian serum lectin is regulated as 
an acute-phase protein synthesized by the liver. 



INTRODUCTION 

Proteins capable of binding mannose-terminated 
glycoproteins have been isolated from the liver and serum 
of rats [1-3], rabbits [4,5] and humans [6-9]. At least two 
mannose-binding proteins (MBPs) are present in human 
serum [8]. The major human serum MBP (MBPI) is a 
homopolymer of approx. 700 kDa composed of subunits 
of 32 kDa and is secreted by the liver [7]. MBPI has 
specificities for A^-acetylglucosamine, TV-acetylmannos- 
amine, glucose and fucose as well as mannose [8]. The 
other MBP (MBP2) is a homopolymer of approx. 
200 kDa composed of subunits of 28 kDa and is specific 
only for mannose and fucose [8]. Both lectins are Ca'*'^- 
dependent. The function of these MBPs is unknown, but 
they have a high affinity for yeast mannan [1-9] and thus 
might act as a part of the immune system, binding and 
inactivating^yeasts and other-micro-organisms bearing 
mannose oligosaccharides. Support for this hypothesis 
conies from the finding that serum MBPs from rat, 
rabbit and human can activate complement through the 
classical pathway [10]. 

Cloning and sequencing of two rat liver MBPs has 
shown that they are divided into several domains: a 
signal peptide, a shon cysteine-rich A^-terminal domain, 
a collagen-like domain, a *neck ' region and a C-terminal 
carbohydrate-recognition domain [11]. The primary 
structure of a human liver MBP has recently been 
determined and found to be similar to that of the rat 
MBPs [12]. Northern-blot hybridization revealed marked 



MBP mRNA accumulation in the liver of a trauma 
victim, suggesting that MBP may be an acute-phase 
rcactant [12]. The sequences of carbohydrate-recognition 
domains of MBPs are highly homologous to those 
found in other carbohydrate-binding proteins, including 
membrane-bound lectins such as the rat asialoglyco- 
protein receptor [13]. By using the conserved amino acid 
residues found in the carbohydrate-recognition domains 
of these proteins as a guide, it has been possible to 
identify putative carbohydrate- recognition domains in 
proteins such as pulmonary-surfactant apoprotein [14] 
and the core protein of cartilage proteoglycan [15]. 

The gene structure of one rat liver MBP (MBP A) has 
been determined and found to share structural features 
with the genes for non-fibrillar collagens as well as with 
the genes for other carbohydrate-binding proteins [16]. 
In the present paper we give the structure of the gene 
encoding a human MBP, which we suggest is. the gene 
encoding the major human serum MBP (MBPI). We 
show that the structure of this gene is very similar to that 
of the gene encoding rat liver MBP A. We haveT also 
identified consensus sequences in the promoter region of 
the human MBP gene that indicate that its expression is 
regulated as part of the acute-phase response, 

MATERIALS AND METHODS 
Protein sequence analysis 

The major human serum MBP (MBPI) was isolated 
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from pooled serum collected from patients undergoing 
venesection for haemochromatosis by methods previ- 
ously described [8]. Affinity chromatography w»th a 
monoclonal antibody against human serum MBFl 
(Mab37) was used to purifiy MBPl to homogeneity, as 
judged by SDS/polyacrylamide-gel electrophoresis. The 
A^-terminal amino acid sequence of this serum MBP 
sample was determined on an Applied Biosystems gas- 
phase sequencer by Dr. K. Drickamer (Columbia Uni- 
versity, New York, NY, U.S.A.). 

Isolation of clones 

A cosmid library of human genomic DNA was kindly 
provided by Dr. Dimitris Kioussis (National Institute for 
Medical Research, London, U.K.). This library was con- 
structed in the cosmid vector cos 202 by usmg partially 
digested Sau3M fragments of genomic ^^NAJrom the 
human acute-lymphocytic-leukaemia cell line HPB-ALL 
[171 Some 500000 colonies were screened for the presence 
of MBP-encoding inserts by the method of Grunstein & 
Hogness [18]. The probe used (P^^^e 4) was a 40^^^ 
oligonucleotide with the sequence 5 -^GGGA AC^ ^;;.A- 
CAGACGGCAGATGGGAGGTGGAAGCAAGGG- 
G-3' deri ved from the region of the cDNA clone^[12] cod- 
ing for the carbohydrate-binding domam of MBP (Fig. 3a, 
cxon 4, bases 645-607, lower strand). The oligonucleotide 
was labelled with [y-^:'^P] ATP to a specific radioactivity of 
10' c.p.m.///g by the use of T4 polynucleotide kinase 
ri9] Duplicate filters were prehybridized overnight at 
55 «C in a solution composed of 6 x SSC, 5 x Denhardt s 
solution, 0,5% SDS and 100 /*g of denatured ^emng 
testis DNA/ml and then hybridized for 8 h at 55 C in 
the same solution containing the labelled probe at 
10« cp.m./ml. [IxSSC is 0.15 M-NaCl/l5 mM-sodium 
citrate buffer, pH 7.0, and I x Denhardt's solution is 
0.02 % Ficoll / 0.02 % polyvinylpyrrolidone / 0.02 /o 
bovine serum albumin (Pentax fraction V),] The filters 
were washed at 65 for 30 min Jn 2xSSC/0.1% 
SDS and autoradiographed. Colonics^ corresponding to 
positive signals were purified by two rounds of screenmg 
at low density. 

Analysis of clones 

Four further oligonucleotide probes were used, each 
having a sequence found in the cDNA clone Probe 
1 was 31 bases long with the sequence 5'-dGGTG- 
TGAG AAAACTCAGGG AAGGTTAATCTC- 3' deri- 
ved from the 5' untranslated region of the cDNA clone 
(Fig. 3a, exon 1, bases 57-27, lower strand^ Pro^ 
31 bases long with the sequence 5 -dGOCCC 1 i i 
GTCCTGGTGACCCAGAAGGCCC-3' derived from 
the region of the cDNA clone encoding the collagen-like 
domain (Fig. 36, exon 2, bases 580-550, lower strand). 
Probe 3 was 29 bases long with the sequence 5 -dCC- 
ATTCCTGGGGGTGGCCACAGAGGCTGG-3' de- 
rived from the region of the cDNA clone encoding the 
carbohydrate-binding domain (Fig. 3^. exon 4, bases 
408-379 lower strand). Probe 5 was 28 bases long with 
the sequence 5'-dGGTCATGTCATTAGGGGAGCC- 
CTCTACC-3' derived from the 3' untranslated region of 
the cDNA clone (Fig, 3d, exon 4, bases 1065-1038, lower 
strand). The sequences of some probes differed slightly 
from the sequence in Fig. 3 since they were based on the 
reported sequence of the cDNA clone [12], which we 
believe contains errors (see below). Southern-blot 
analysis was used to identify the restriction fragments of 
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the cosmid insert that hybridized to the probes. Frag- 
ments containing sequence of interest were cloned into 
M13 mpl8 and M13 mpl9 and sequenced by the 
dideoxy method [20] with the use of Sequenase (United 
States Biochemical Corporation). The strategy for the 
nucleotide sequence analysis is shown in Fig. 2. 

RESULTS AND DISCUSSION 

Protein sequences of human liver and serum MBPs 

Our aim was to isolate the gene encoding the major 
human serum MBP (MBPl) by using sequence data 
from the human liver MBP cDNA clone sequenced by 
Ezekowitz et al. [12]. However, since both human and rat 
liver contain at least two MBPs [9,1 1], we needed to show 
that the human liver cDNA sequenced by Ezekowitz 
et al [12] encoded human serum MBP. The iV-terminal 
amino acid sequence of serum MBPl is shown m Fig. 1. 
At all 21 positions determined, this sequence matched 
perfectly the amino acid sequence predicted fronf} th^ 
nucleotide sequence of the human liver MBP cDNA 
clone reported by Ezekowitz et aL [12]. This strongly 
suggests but does not prove absolutely that the cDNA 
clone encodes the major human serum MBP (MBPl). 

Isolation of positive clones 

Screening of 500000 colonies with radiolabelled 
oligonucleotide probe 4 yielded one positive clone 
(cMBPl). This clone contained an insert of approx. 
30 kb. Southern-blot analysis with ohgonucleotide 
probes corresponding to sequences at the 5' and 3 ends 
of the cDNA clone (probes 1 and 4) showed that cMBPI 
contained the whole of the transcribed region of the gene 
together with approx. 9 kb of 5' flanking sequence and 
16 kb of 3' flanking sequence. Therefore no further 
screening was performed. A partial restriction map of the 
clone is shown in Fig. 2. '' 

Intron/exon structure of the^^gene 

The nucleotide sequence of the 5' flanking sequence, 
the exons and the intron/exon boundaries of the MBP 
gene* contained within cMBPl is shown in Fig. 3. 
Comparison of this nucleotide sequence with that ot the 
cDNA clone [12] (Fig. 4) shows that the protein-coding 
region of the gene is interrupted by three mtrons of 
approx. 600 bp, 1350 bp and 800 bp in size. The 
sequences at all of the intron/exon boundaries contain 
the usual splice donor and acceptor consensus sequences 
[21]. The predicted amino acid sequence of the protem 
encoded by this gene is shown in Fig. 5. 

The intron/exon structure and the sizes of tlie introns 
are almost identical with those of the rat MBP A gene 
[16]. Thus exon 1 encodes the signal peptide, a cysteine- 
rich domain and seven copies of the Gly-Xaa-Yaa motif 
typical of the triple-helix-forming region of collagen. At 
the site of the first intron the regular Gly-Xaa-Yaa 
pattern is interrupted by the sequence Gly-Gln-Gly. 
Exon 2 then encodes a further 12 Gly-Xaa-Yaa repeats. 
Both rat MBP A and rat MBP C have the same smgle 
Gly-Gln-Gly interruption in the Gly-Xaa-Yaa repeat 
structure. A regular Gly-Xaa-Yaa pattern encoded by 
two adjacent exons is typical of genes encoding other 
non-fibrillar collagens [22-25]. Exon 3 encodes a ^neck 
region and exon 4 encodes a carbohydrate- recognition 
domain. This is also the case for the rat MBP A gene [1 6]. 
Moreover, the carbohydrate-binding domain of the rat 
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ND-Thr-Val-Thr-ND-Glu-ND-Ala-Gln-Lys-Thr-ND-Pro-Ala-ND-Ile-ND-ND-ND-ND-Pro-Gly-Ile-Asn-Gly-Phe-Pro-Gly-Lys-Asp 
Fig. 1. /V-Terminal amino acid sequence of the major human serum mannose-binding protein (MBPI) 
Abbreviation: ND, not determined. 




2 



id) 



Fig. 2. Partial restriction map of cosmid genomic clone cMBPl containing the gene encoding human MBPI 

(fl) Insert of cosmid clone cMBPl. {b) Region of cMBPl sequenced, (r) Structure of MBPI gene, {d) Sequencing strategy. 
Abbreviations: Ba, BamH\ ; Bs, BstEU ; C, aa\\ E, EcoKU P, Pstl; R. Rsal\ S, Stu\\ X, Xbal\Xh, Xhol Coding regions arc 
indicated by black boxes. 5' and 3' untranslated regions arc indicated by open boxes. 



liver asialoglycoprotein receptor is also encoded by 
discrete exons separated from the exons encoding other 
functional domains [26], This is consistent with the 
hypothesis that evolutionary shuffling has allowed 
carbohydrate- recognition domains to become associated 
with different effector molecules [13]. 

The sequence of the exons was found to be different 
from the published cDNA sequence [12] in seyeYal places, 
as shown in Fig. 4. The 3' untranslated region of the 
published cDNA clone [12] contains two stretches of 23 
and 56 nucleotide residues that are absent from the 
genomic clone. In addition, nucleotide residues 975-1 128, 
which are present in exon 4 (Fig. 3d, nucleotide residues 
728-958) are absent from the 3' untranslated region of 
the cDNA clone. It is unlikely that this reflects the 
presence of an intron in the 3' untranslated region of the 
human MBP mRNA, siiice there are no consensus splice 
donor and acceptor sites at the appropriate positions in 
the genomic sequence (Fig. 3d), It is possible that these 
differences arose from reverse transcriptase errors in the 
construction of the cDN A clone or that they arose 
because the cDNA and genomic clones derive from two 
distinct genes. The remainder of the. sequence differences 
are changes, deletions or insertions of single base-pairs 
that, in some cases, yield differences in the amino acid 
sequences predicted from the two nucleotide sequences. 

The amino acid sequences predicted from our se- 
quence, from the published cDNA sequence [12] and 
from the two rat MBPs [11] are shown in Fig. 5. As 
indicated, the signal sequence predicted from the genomic 
clone has one extra amino acid residue (residue 7) and 
four amino acid residue changes when compared with 
that predicted from the cDNA clone. The signal sequence 
predicted from the genomic clone retains the typical 
hydrophobic structure expected in a secreted eukaryotic 



protein [27] and, in particular, has a serine residue 
at position 20, immediately preceding the A^-terminal 
glutamic acid residue of the mature protein (Fig, 5). This 
conforms with the general observation that the amino 
acid residue immediately preceding the site of cleavage 
by the signal peptidase has a small side chain [28]. There 
are three amino acid residue differences in the first part of 
the collagen-like domain in exon 1, and in exon 3 only six 
of the 22 amino acid residues are the same as those 
published. Finally, there are 16 amino acid residue 
differences in the carbohydrate-binding domain (exon 4). 
The amino acid residue changes increase the overall 
homology of the human MBP to the rat MBPs [1 1], 
especially in the carbohydrate-recognition domain. Thus 
in 21 positions a residue is changed to that found in rat 
MBP C and in 16 positions a residue is changed to that 
found in rat MBP A. In particular, Thr-187 predicted by 
cMBPl is one of the 18 invariant residues found in all 
mammalian carbohydrate-binding proteins [29] whereas 
Pro- 187 predicted by the cDNA clone is not. These 
results give values of 61 % for the homology of the 
human MBP to rat MBP C and of 52 % for the homology 
of the human MBP to rat MBP A. 

It is possible the sequence of the cDNA clone [12] 
differed from that of our genomic clone because the 
genomic clone derives from a second MBP gene. How- 
ever, there are two lines of evidence that suggest that the- 
differences arise because the sequence of the cDNA clor^e;; 
[12] was partially incorrect. Firstly, where there ..ar|. 
stretches of amino acid residue differences {^yVng,^^^ 
residues 107r-l23) this reflects an alteration of the re^^mgv 
frame caused by a single-base deletion/insertion rather;: 
than extensive nucleotide differences (Fig. 4, nud^iles;:: 
389-^0). Secondly, if the two clones were denved.frpm; 
different genes encoding closely related pro terns we Avould 
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expect to find extensive nucleotide differences in the 
third-base position of codons, even when the predicted 
amino acid sequences were identical. This is seen when 
the rat and human nucleotide sequences are compared, 
but not when the human cDNA and genomic sequences 
are compared. For these reasons we suggest that the 
human cDNA [12] and genomic clones are derived from 
a single human MBP gene. In view of the identity of 



their products' predicted TV-terminal sequence with the 
A^-terminal sequence of the major human serum MBP 
(MBPl) (Fig. 1), we suggest that they derive from the 
human serum MBP I gene. However, we cannot exclude 
the possibility that they represent two different MBP 
genes, in which case our data do not allow us to identify 
which clone, if either, represents the human serum MBPl 
gene. If the cDNA [12] and genomic clones are derived 



(a) 5' flanking sequence, axon 1 and portion of intron 1 

ctgcagtggagactgtctttgttttcaaagggaaacttggaggcttagacctatggggct -784 

aggctgctgaggtttcttagggggcaatagctggaagaaagctctga^GAACAAATGaaa -724 

ggttaatactgagaaatgggaggaggattcaaggcaagttttctaattgccagtggtttt -664 

tgactcacAGAACATggggaattcctgccagaaagtagagaggtatttagcactctgcca -604 

qqqccaacqtaqTAAGAAATTTCCAGaqaa^a1-go1-1-a^^r^r.grjr^^=.^^^totr-t-r\aaac -544 

accaaggggaagcaaactccagttaattctgggctgggttggtgactaaggttgaggttg -4 84 

atctgaggttgagaccttcctctttggatcaccagctttcagctcagggcctgccaatga -424 

gtaaatgatagttaacaggtcctggaggggaatcagctgcccagatacaaagatgggatt -364 

caggtggcagatggacccgaagaggacatggagagaaagaggaagctcctacagacacci: -3 04 

gggtttccactcattctcattccctaagctaacaggcataagccagctggcaatgcac^ -244 

TCCCATTTGyyqTcactgccaccgaaaqcatqtttataqti CTTCCAGCAGCAACGCCAGG -184 

tgtctaggcacagatgaacccctccttaggatccccactgctcatcatagtgcctacctt -124 

tgttaaagtactagtcacgcagtgtcacaaggaatgtttactttt£CMATccccagcta -64 

gaggccagggatgggtcatctatttcTATASAgcctgcacccagattgtaggacagaggg -4 

catGCTCGGTAAATATGTGTTCATTAACTGAGATTAACCTTCCCTGAGTTTTCTCACACC .57 

A AAGGTGAGGACCATGTCCCTGTTTCCATCACTCCCTCTCCTTCTCCTGAGTATGGTGGCA 117 
MetSerLeuPheProSerLeuProLeuLeuLeuLeuSerMetValAla 

GCGTCTTACTCAGAAACTGTGACCTGTGAGGATGCCCAAAAGACCTGCCCTGCAGTGATT 177 
AlaSerTyrSerGluThrValThrCysGluAspAlaGlnLysThrCysProAlaVallle 

GCCTGTAGCTCTCCAGGCATCAACGGCTTCCCAGGCAAAGATGGGCGTGATGGCACCAAG 2 37 
AlaCysSerSerProGlylleAsnGlyPheProGlyLysAspGlyArgAspGlyThrLys 

GGAGAAAAGGGGGAACCAGgtacgtgttgg " " 2 67 

GlyGluLysGlyGluProG 

(b) Exon 2 and flanking sequences 

aacaacaactactagcaaaacaaatgcagttaattttcactttgcaccctccctgcagca 60 

acctccacgtggcaactttatttcttaagttattgctctcaggtgcacaccatacagtta 120 

ttgagagcagtgctcagaaaggtcagtcctgggtcaaggtctcctttctcctgagaagga 180 

ttggca tcaaactcttgaagagagagcaagaaca tgagatattaag t cacattcctttgt 24 0 

cttGcaacagGCCAAGGGCTCAGAGGCTTACAGGGCCCCCCTGGAAAGTTGGGGCCTCCA 300 
lyGlnGlyLeuArgGlyLeuGlnGlyProProGlyLysLeuGlyProPro 

GGAAATCCAGGGCCTTCTGGGTCACCAGGACCAAAGGGCCAAAAAGGAGACCCTGGAAAA 360 
GlyAsnProGlyProSerGlySerProGlyProLysGlyGlnLysGlyAspProGlyLys 

AGTCCGGgtaaggacccc 378 
SerProA 
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(c) Exon 3 and flanking sequences 

tctagATGGTGATAGTAGCCTGGCTGCCTCAGAAAGAAAAGCTCTGCAAACAGAAATGGC 60 
spGlyAspSerSerLeuAlaAlaSerGluArgLysAlaLeuGlnThrGluMetAl 

ACGTATCATUVAAGTgtaagctttttctcttactctccaggcagcttgaagtttgggaaaa 120 
aArglleLysLysT 

atagatgcaacaaatatttgttgaatgcatataattttctgtaccctgctaggcatttct 180 

catattcttacctcatgaaattctcacaacattttggtagaaatggaggcaaagggaagt 240 

taaattacttgttcaaatgcacagagctaataaatggcaggggtggtttatagatggaag 300 

tcagtctgactcgagagaccctaatcctttaccgtctgatattgctcactgaaaatggga 360 

cttgagatttgggtggtcactggtattgagacctggccgtggggtctaacctgcctgggg 420 

caaatattttcaga 

{d) Exon 4 and flanking sequences 

accagtctgtctgttcacatttgggttgccatatttaacaaataaaaagacaaagcacca 60 

gttaaattggatttcagctgaaaaacatttttagcacaagtgtgtctgaaaaattatatg 120 

ggagatatactaaaatattcattgtttgtctgaaattcaaatttaactggacatactata 180 

ttttatccggcaactctactctagaagactttttcttgagaaataccttgagttgggctt 240 

aaqqatgaqtgagtttcacccactttttcacattttagGGCTGACCTTCTCTCTGGGCAA 300 

rpLeuThrPheSerLeuGlyLy 

ACAAGTTGGGAACAAGTTCTTCCTGACCAATGGTGAAATAATGACCTTTGAAAAAGTGAA 360 
sGlnValGlyAsnLysPhePheLeuThrAsnGlyGluIleHetThrPheGluLysValLy 



GGCCTTGTGTGTCAAGTTCCAGGCCTCTGTGGCCACCCCCAGGAATGCTGCAGAGAATGG 420 
sAlaLeuCysValLysPheGlnAlaSerValAlaThrProArgAsnAlaAlaGluAsnGl 

AGCCATTCAGAATCTCATCAAGGAGGAAGCCTTCCTGGGCATCACTGATGAGAAGACAGA 460 
yAlalleGlnAsnLeuIleLysGluGluAlaPheLeuGlylleThrAspGluLysThrGl 

AGGGCAGTTTGTGGATCTGACAGGAAATAGACTGACCTACACAAACTGGAACGAGGGTGA 540 

uGlyGlnPheValAspLeuThrGlyAsnArgLeuThrTyrThrAsnTr^ , /\ 

ACCCAACAATGCTGGTTCTGATGAAGATTGTGTATTGCTACTGAAAAATGGCCAGTGGAA 600 
uProAsnAsnAlaGlySerAspGluAspCysValLeuLeuLeuLysAsnGlyGlnTrpAs 



TGACGTCCCCTGCTCCACCTCCCATCTGGCCGTCTGTGAGTtCCCTATCTGAAGGGTCAT 660 
nAspValProCysSerThrSerHisLexiAlaVaiCysGluPheProIleEnd 

ATCACTCAGGCCCTCCTTGTCTTTTTACTGCAACCCACAGGCCCACAGTATGCTTGAAAA 720 

780 



ATGATCACTAACAGCACCAACAAAGCAATAATAGTAGTAGTAGTAGTTAGCAGCAGCAGT 840 

AGTAGTCATGCTAATTATATAATATTTTTAATATATACTATGAGGCCCTATCTTTTGCAT 900 

CCTACAATTAATTATCTAGTTTAATTAATCTGTAATGCTTTCG ATAGTGTTAACTTGCTG 960 

CAGTATGAAAATAAGACGGATTTATTTTTCCATTTACAACAAACACCTGTGCTCTGTTGA 1020 

GCCTTCCTTTCTGTTTGGGTAGAGGGCTCCCCTAATGACATCACCACAGTTTAATACCAC 1080 

AGCTTTTTACCAAGTTTCAGGTATTAAGAAAATCTATTTTTGTAACTTTCTCTATGAACT 1140 

CTGTTTTCTTTCTAATGAGATATTAAACCATGTAaagaacataaataacaaatctcaagc 1200 

aaacagcttcacaaattctcacacacatacatacctatatactcactttctaga 1254 
Fig. 3. Partial nucleotide sequence of human MBPl gene 

Sequences from introns and the 5' flanking region are shown in lower case. The consensus sequences in the 5' ^i^king region 
are shown in upper case and are underlined (see the text for details). The four exons are shown in upper case. The sequences 
from which the five oligonucleotide probes were derived are marked with a hne above them. 
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GGTCGGtAAATATGTGTTCATTAACTGAGATTAACCTTCCCTGAGTTTTCTCACACCAAGGTGAGGACCATGTCCCT 
GCTCGGTAAATATGT-TTCATTAACTGAGATTAACCTTCCCTGAGTTTTCTCACACCAAGG-GA 

154 

GTTTCGATCACTCCCTCTCCTTCTCCTGAGTATGGTGGCAGCGTCTTACTCAGAAACTGTGACCTGTGAGGATGCCC 
GTTTC-ATCACTCCCTCTC-TTCTCCTGAGTATGGTGGCAGCGTCTTAC-CAGAAACTGTCACCTGTGAGGATGCCC 

231 

AAAAGACCTGCCCTGCAGTGATTGCCTGTAGCTCTCCAGGCATCAACGGCTTCCCAGGCAAAGATGGGCGTGATGGC 
AAWIGACCTGCCCTGCAGTGATTGCCTGTAGCTCTCCAGGCATCAACGGCTTCCCAGGCAAAGATCGGCGTGAT^ 

^ 306 
ACCAAGGG-AGAAAAGGGGGAACCAdG-GCCAAGGGCTCAGAGGCTTACAGGGCCCCCCTGGAAAGTTGGGGCCTCC 
ACCAAGGGTAGAAAAGGGGGAAC-AGiST— CAAGGGCTCAGAGGCTTACAGGGCCCCCCTGGAAAGTTGGGGCCTCC 

2 . 3 383 
ITGGTGATA 
,!ITGGTGATA 

S^^5H^:^"^^'^^^^^^^^^^^^^^AAGCTCTGC^ 

GTAGCCC-AGGCTGCCTCAGAAAGAAAAGCTCTGCAAACAGAAATGGCACGTATCAAAA-G$-C-^ 

534 

TCTCTGGGCRAACAAGTTGGGAACAAGTTCTTCCTGACCAATGGTGAAATAATGACCTTTGAAAAAGTGAAGGCCM 
TCTCTGGGCAAACAAGTTGGGAACJUVGTTCTTCCTGACCAATGGTGAAATAATGACCTTTGJUWUUIGTGAAGGCC^ 

GTGTGTCAAGTTCCAGGCCTCTGTGGCCACCCCCAGGAATG-CTGCAGAGAATGGAGCCATTCAGAATCTCATCAAG 

gtgtgtcaagttccag-cctctgtggccacccccaggaatggctgcagagaatggagccattcagaatctStSa^ 

686 

GAGGAAGCCTTCCTGGGCAT-CACTGATGAGAAGACAGAAGGGCAGTTTGTGGATCTGACAGGAAATAGACTGACCT 
GAGGAAGCCTTCCTGGGCATGC-CTGATGAGAAGACAGAAGGGCAGTTTGTGGATCTGACAGGAAATAGACTGACC^ 

ACACAAACTGGAACGAGGGTGAACCCAACAATGCTGGTTCTGATGAAGATTGTGTATTGCTACTGAAAAATGGCC^ 
ACACAAACTGGAACGAGGGTGAACCCAACAATGCTGGTTCTGATGAACATTGTGTATTGCTACTGAAAAATGGC^^ 

TGGAATGACGTCCCCT-GeT-CCACCTCCCATCTGGCCGTCTGTGAGTTCCCTATCTGAAGGGTCAT ° 

TGGAATGAC-TCGCCTTGCTTCCACCTCCCATCTGCC-GTCTGTGAGTTCCCTATCTGAAGGG^^ 

886 

" ATCACTCA GXk:CCTCCTTGTCTrrTTACTGCA-ACGC-ACJVGGCCCACA6TATGC'rTGAA 

TATCTGAAGGGTCAT-A-tCACTCAGGCCCTCCTTGTCTTTTTACT-C-TACCACA---i6ccS 

963 

"•"uATAATGAT*" — — — —— — — —— — — — _ ^ ^ ^ . 

GCACCAACAAAGCAATAATAGTAGTAGTAGTAGTTAGCAGCAGCAGTAGTAGTCATGCTAATTATAT^ 
ATATATACTATGAGGCCCTATCTTTTGCATCCTACAATTAATTATCTAGTTTAATTAATCTGTAAT 

TGTTAACTTGC 'Jl CAGTATGA^-I 

AATATATCATTCTCAGATCAGTACCTGCAGATGAAAATATCAGATCA6TACCTGCAG-ATGAA-GA 

" ■ 1214 

TAAGACGGCATTTATTTTTCCATTTACAACAAACACCTGTG-T--GTTGAGCCTT-ACTTTCTGTTTGGGTAGA^^ 

12 90 

CTCCCCTAATGACATCACCACAGTTTAATACCACAGCTTTTT-ACCAAGTTTCAGGTATTAAGAAAATCTATTTW^ 
CTCCCCTAATGACATGACCACAGTTTAATACCACAGCTTTTTTACCAAGTTTCAG^^^ 

- ■ TAACTTTCTCTATGAACTCTGTTTTCTTTCTTUITGAGATA^ -1342 

TAACTTTCTCTATGAACTCTGTTTTCTTTCTAATGAGATATTAAACCATGTAAAAAAAAAAAAAAAAAA 
4. Comparison of the nucleotide sequences of the exons of cMBPJ (upper) and the hepatic cDNA clone [121 (lower) 

he cMBPI sequence is numbered. The three intron/exon boundaries ofcMBPl are shown as vertical lines and are numbered, 
he translation mitiation and termination codons are marked by horizontal lines. Gaps in the sequence are indicated by 
ypnens. See the text for details. 



n the same gene, we suggest that the sequence of our 
omic clone is the correct sequence because the pre- 
ed changes increase the overall homology of human 
P to rat MBP C and rat MBP A. as described above. 

tream regulatory sequences 

xamination of the 843 bp of sequence ob- 
ed upstream of the 5' end of the cDNA clone [12] 
wed the presence of several consensus sequences, 
zh are indicated in Fig. 3. TATAA and CAAT 



boxes [21] are found at -38 bp and -79 bp from 
the start of the cDNA clone respectively. There is a 
region similar to the Drosophila heat-shock pro- 
moter consensus sequence CTnGAAnnTTCnAG [30] at 
-592 bp (TAAGAAATTTCCAG). In Drosophila the 
presence of heat-shock consensus sequence in the 5' 
region of the heat-shock genes is necessary for heat- 
induced chromosome 'puffing' and the co-ordinate 
increase in transcription of the heat-shock gene products 
[30]. The heat-shock consensus element in the 5' region 
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20 

MBP MetSerLeuPheProSerLeuProLeuLeuLeuLeuSerMetValAlaAlaSerTyrSer - - - 
cDNA Cys IleThr - Ser _ - _ 

MBPC Thr - Phe CysValLeuThr - Val Ala - - - 

MBPA Leu Leu - LeuVal CysValVal - Val Ser GlySerGlu 

40 

MBP _ _ - GLUThrValThrCysGluAspAlaGlnLysThrCysProAlaVallleAlaCysSerSer 
cDNA - - - 

MBPC - - - Leu - Gly SerSer 

MBPA ThrCysGlu LeuLys ------ Ser - - 

MBP ProGlylleAsnGlyPheProGlyLysAspGlyArgAspGlyThrLysGlyGluLysGlyGluProGlly 
cDNA Arg GlyThr 

MBPC Leu His Ala 

MBPA ------- Arg Pro 

86 

MBP GlnGlyLeuArgGlyLeuGlnGlyProProGlyLysLeuGlyProProGlyAsnProGlyProSerGly 
cDNA 

MBPC Val Ala Pro AsnPro 

MBPA SerVal AlaPro 

X07 

MBP SerProGlyProLysGlyGlnLysGlyAspProGlyLysSer - - ProAbpGlyAspSerSerLeu 
CDNA - I Pro 

MBPC Lys Ala Thr Pro Arg - - ValGluPhe ThrThrAsn 

MBPA Gin Arg Asp ArgAlalleGluValLysLeuAlaAsn 

I 130 

MBP AlaAlaSerGluArgLysAlaLeuGlnThrGluMetAlaArglleLysLysTtpLeuThrPheSerLeu 
cDNA GlyCysLeuArgLys SerSerAlaAsnArgAsnGlyThrTyrGln ' 

MBPC IleAspLeu IleAla ArgSer LeuArgAlaMetArg TrpValLeuLeu Met 

MBPA MetGluAla IleAsnThr LysSerLysLeuGluLeuThrAsn LeuHisAla Met 

153 

MBP GlvLYsGlnValGlyAsnLysPhePheLeuThrAanGlyGluIleMetThrPheGluLysValLysAla 
CDNA 

MBPC SerGluAsh Ly3 Tyr MetSerSerValArgArg ProLeuAsnArgAla 

MBPA Lys Ser Lys Val His Arg Pro Ser 

176 

MBP Leu CyBV alLysPheGlnAlaSerValAlaThrProArQAsnAlaAlaGluAsnGlyAlalleGlnAsn 
cDNA ProLeuTrpProPro GlyMet 

MBPC SerGluLeu GlyThr Glu Arg 

MBPA SerGluLeuArgGlyThr lie Glu Lys Glu 

199 

MBP LeulleLvsGluGluAlaPheLeuGl y ll eThrAapG luLysThrGluGlvGlnPheValAgpLe uThr 
cDNA MetPro 

MBPC ValAla Aspval GlnArg AsnVal Glu 

MBPA ValAla ThrSer . Val MetTyrVal 

^ 222 

MBP GlvAsnArqLeuThrTvrThrAsnTr pA snGluGlyGl uProA anAsnAlaGl yS erAs pGluA s pCya 
cDNA His 
MBPC Val Arg ' Val Gly Asn 

MBPA Gly Ser LysLysAsp AspHis Gly Asp 

245 

MBP ValLeuLeuLeuLysAsnGl yG lnTrp ABnAflpV alPro Cye SerThrSerHisLeuAlaVal CyflGltt 
cDNA Ser PheHisLeuProSer 

MBPC Val Thr Lys Asp Phe Val 

MBPA ThrlleValAsp Leu IleSer GlnAla Thr 

248 

MBP PheProIle 
CDNA 

MBPC SerAsp 
MBPA Ala 

Fig. 5. Comparison of the amino acid sequences predicted from cMBPl, the human hepatic MBP cDNA clone |12] and the rat hepatic 
cDNA clones for MBP C and MBP A |1I| 

The amino acid sequence predicted from cMBPI is numbered. Residues identical with those predicted from cMBPi are indicated 
by clear spaces. Gaps in the sequence, introduced to optimize alignment, are indicated by hyphens. The intron/exon boundaries 
of MBPl are shown by vertical lines. The yV-terminai residue (Glu) of MBPl is in upper case. Invariant residues in the 
carbohydrate-binding domain are in bold type and underlined. See the text for details. 

of the MBP gene may have an analogous function and There are three regions of the 5' flanking sequenqe of 

lead to enhancernent of MBP transcription during the the MBP gene that are similar to consensus sequenc 

acute-phase response. It is noteworthy that heat-shock glucocorticoid-responsive elements. At --245 bp is a 

consensus sequences are present in the 5' flanking region sequence (GGTCCCATTTGTTGT) similar to the gluco- 

of the human C-reactive-prolein gene, whose product is corticoid-responsive element GGTAGAhnnTGTTCT 

also an acute-phase protein [31]. [32-34]. At - 656 bp is a sequence (A GA AC AT) identical 
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with another glucocorticoid-responsive element [35], and 
at —736 bp is a sequence (AGAACAAATG) similar to 
the glucocorticoid-responsive element AGAACAGATG 
[36]. Increased release of glucocorticoid hormones by the 
adrenals is an important part of the response to stress 
and results in elevated serum concentrations, principally 
of Cortisol. Thus the glucocorticoid-responsive elements 
in the promoter region of the MBP gene may also serve 
Lo increase MBP expression during the acute-phase 
response. 

The region from -204 bp to —184 bp has 90% 
homology with a part of the promoter region of the 
::loned serum amyloid A gene {SAAg9) [37], whose 
product is another acute-phase protein. The expression 
of the SAAg9 gene is increased by interleiikin I and 
■umour necrosis factor a [38]. The 5' region of the 
SAAg9 gene, which contains the elements responsive to 
!nterlcukin I, also contains the MBP-like 5' sequence, 
ilthough it is not known if this is the sequence that is 
responsive to interleukin 1 (P. Woo, personal communi- 
:ation). The presence of a highly homologous 20 bp 
jequence in the promoter regions of MBP and scrum 
imyloid A protein suggests that MBP synthesis may also 
DC induced by cytokines and is again consistent with 
M BP being an acute- phase protein. Regulatory sequences 
lave not yet been identified in the rat MBP A gene [16]. 

Serum arnyloid A protein and C-reactive protein are 
two major acute-phase proteins, synthesized by the liver, 
kvhich increase in concentration up to 1000-fold or more 
in response to tissue injury or inflammation. The details 
Df the regulation of expression of these genes is unclear, 
but their 5' regi oris contain heat-shock consensus ele- 
ments [31] and sequences responsive to cytokines [38]. 
The report by Ezekowitz ei aL [12] that hepatic MBP 
nRNA concentrations were low in a normal liver but 
wre greatly increased in a human liver that had been 
exposed to acute stress first suggested that MBP might 
ilso be an acute-phase protein. The findings in the 5' 
egion of the human MBP gene of a heat-shock consensus 
sequence, tliree glucocorticoid-responsive elements and 
I sequence with a high degree of homology to a similar 
sequence in the serum-am yloid-A-protein gene lends 
'urther support to the view that MBP is an acute- 
Dhase protein. Further work is required to establish 
vhich factors enhance MBP gene expression and which 
;equences mediate this effect. 

These data also pro y id e insight into the possible 
biological role of MBP. The signal peptide and collagen- 
ike domain of MBP as well as the evidence that it is an 
icute-phase protein all indicate that MBP is a hepatic 
lecretory protein. It seems unlikely that this MBP func- 
ions as an intracellular transport protein [39], As an 
icute- phase protein like serum amyloid A protein or C- 
eactive protein, serum MBP may be a part of the host 
lefences acting as a carbohydrate-specific immune sys- 
em. Potential ligands include bacteria, viruses and yeasts 
hat bear mannose-rich oligosaccharides. The binding of 
vlBP to an organism may then result in its destruction by 
ViBP-mediated activation of the complement cascade 
10]. 
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Blast 2 Sequences results 

PubMed Entrez BLAST OMIM Taxonorn5^**®*^lructure 

BLAST 2 SEQUENCES RESULTS VERSION BLASTN 2.2.6 [Apr-09-20031 

Match:|l Mis match:| -2 ga p open:j5 ! gap extension: j2 
x dropoff: jso expeGt:)lOOOO.( wordsize: )l1 : Filter F Mlijij 



Sequence 1 lcl|seq_l Length 1110 — 'ffiVLatC fifiaP c'htjA: 

Sequence 2 lcl|seq_2 Length 1595 — Se.Q /5 aJO- ' Fj^Tc^a^ OOi^ AfifC. 
No significant similarity was found 



http://www.ncbi.nlm.nih.gov/blast/bl2seq/wblast2.cgi70 
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Blast 2 Sequences results 

Entrez BLAST OMIM Taxonomy Structure 



PubMed 

BLAST 2 SEQUENCES RESULTS VERSION BLASTP 2.2.6 [Apr-09-2003] 



( "•"' ■ "n i n i iiii r i iiii i i n ii n —i m iiiiiiiiiiiiii ii M|- | ""' i- iiii rrn iii ■ | i r ii M i 

^V^^^"^.®^ M gap open:|11 gap extension: |l 



x_dropoff: |50 expect:|lOOOO.| wordsize: |3 Filter F liiSl 



Sequence 1 lcl|seq_l Length 248 (1 .. 248) - -TAycoiC. Ml?^ 

Sequence 2 lcl|seq_2 Length 277 (1 .. 277) — /b ^6 . 2- f^/?*Ai qq/Z A/^^C. 



NOTE:The statistics (bitscore and expect value) is calculated based on the size of nr database 
Score = 107 bits (268) , Expect = 2e-22 

Identities = 75/241 (31%), Positives = 112/241 (46%), Gaps = 29/241 (12%) 



Query: 3 3 

Sbjct: 34 

Query: 8 6 

Sbjct: 92 

Query: 132 

. .Sb j C t_: JL52 

Query: 187 

Sbjct: 212 

Query: 24 6 

Sbjct: 272 

CPU time: 

Lambda 
0.312 



PAVIACSSPGINGFPGKDGRDGTKGEKGEPGQ GLRGLQGPPGKLGPPGNPGPS 85 

P C++ 1+ PG G DG KG+ GE G+ G +G++G G +G GN G + 

PTAEVCATHTIS - -PGPKGDDGEKGDPGEEGKHGKVGRMGPKGIKGELGDMGDRGNIGKT 91 

GSPGPKGQKGDPG KSPDGDSSLAASERK ALQTEMARIKKWLTFSLG 131 

G G KG KG+ G KG RK L +AR+K + F 

GPIGKKGDKGEKGLLGIPGEKGKAGTVCDCGRYRKFVGQLDISIARLKTSMKFVKNVIAG 151 

- KQVGNKFFLTNGE IMTFEKVKALCVKFQAS VATPRNAAENGAIQNLI KEEAF LGI 186 

+ + KF+ E + + C +A P++ AN I + + + F +G+ 

IRE TEEK FYYIVQEEKNYRESLTHCRIRGGMIjAMPKDEAANTLIADYVAKSGFFRVFIGV _2 1 1, 

TDEKTEGQFVDLTGNRL-TYTNWNEGEPNNAGSDEDCVIiLLKNGQWNDVPCSTSHLAVCE 245 

D + EGQ++ L. Y+NWNEGEP++ EDCV +Ij +G+WND C + VCE 

NDLEREGQYMFTDNTPLQNYSNWNEGEPSDPYGHEDCVEMLSSGRWNDTECHLTMYFVCE 2 71 

F 246 
F 

F 272 



0.01 user sees. 



0 . 00 sys . sees 



0.01 total sees. 



K 



H 



0 . 134 



0.405 



Gapped 



http://www.ncbi.nlm.nih.gov/blast/bl2seq/wblast2.cgi71 
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Liambda 
0.267 



K H 
0 .0410 



0 . 140 



Matrix: BL0SUM62 

Gap Penalties: Existence: 11, Extension: 1 

Number of Hits to DB : 939 

Number of Sequences : 0 

Number of extensions: 91 

Number of successful extensions: 30 

Number of sequences better than 10000.0: 1 

Number of HSP's better than 10000.0 without gapping: 1 

Number of HSP's successfully gapped in prelim test: 0 

Number of HSP's that attempted gapping in prelim test: 0 

Number of HSP's gapped (non-prelim): 1 

length of query: 248 

length of database: 497,145,297 

effective HSP length: 124 

effective length of query: 124 

effective length of database: 497,145,173 

effective search space: 61646001452 

effective search space used: 61646001452 

T: 9 

A: 40 

XI: 16 ( 7.2 bits) 
X2 : 129 (49.7 bits) 
X3 : 129 (49.7 bits) 
SI: 42 (21.8 bits) 
S2: 47 (22.7 bits) 



http://vvww.ncbi.nlm.nih. gov/blast/bl2seq/wblast2.cgi?l 
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PATENT 

Attorney Docket No. 19036/36615 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Applicant: Nobutaka Wakamiya 
Serial No.: 09/600,932 
Filed: September 8, 2000 
For: Novel Collectin 
Group Art Unit: 1645 
Examiner: R. Mitra 



CERTIFICATE OF MAILING 

I hereby certify that this paper is being 
deposited with the U.S. Postal Service 
as First Class Mail in an envelope, 
postage prepaid, addressed to the 
Commissioner for Patents, P.O. Box 
1450, Alexandria, VA 22313-1450 on 
November ^2003. 



Mark H. Hopkins, Ph.D. 



DECLARATION OF NOBUTAKA WAKAMIYA 
PURSUANT TO 37 CFR § 1,132 

I, Nobutaka Wakamiya, a citizen of the Japan, residing at 1-4, Toko-Gojo 10-chome, 

Asahikav^a-shi, Hokkaido 078-8345, Japan, hereby declare that: 

/A ^ 

1. I am the sole inventor of the invention disclosed and claimed in the above- 
identified patent application, U.S. Patent Application Serial No. 09/600,932 ("the 
patent application"), originally filed on Se ptem ber 8, 2002, as a §371 of 
PCT/JP98/03328. As such, I am thoroughly familiar with the patent application as 
originally filed and amended. 



2. My qualifications and technical experience are set out in my curriculum vitae^ 
a copy of w^hich is attached as Appendix A. 



3. I have read and understand the official action from the U.S. Patent and 
Trademark Office (the "Patent Office") dated June 24, 2003 (the "Office action"), 
which was issued in connection with U.S. Patent Application Serial No. 09/600,932. 
I also have reviewed and understand the patents and publications cited by the 
examiner in the office action. I make this declaration to provide information known 
to me that may be relevant to various claim rejections in the office action, 

4. Various claims stand rejected over a GenBank Accession Number AB002631 
singly or in combination with the Kawai et al. publication for reasons set forth in the 
office action. The examiner stated that "While the reference would appear to be 24 
Jan 1999, the record indicates a submission of 4 Apr 1997 which, absent factual 
evidence to the contrary, predates the instant application filing and the foreign priority 
date. The reference discloses a polynucleotide that encodes a protein of SEQ ID 
NO:2 and is, absent factual evidence to the contrary, a polynucleotide with the 
sequence of SEQ ID NO:l. Thus, claims 1, 2, and 6 are anticipated." 

5. As indicated by the Examiner, the sequence data that appeared in the GenBank 
reference was actually submitted with GenBank on April 4, 1997 by the group of the 
present inventor. At the depositing of said sequence data, the present inventor 
simultaneously requested that GenBank not publish said sequence data until the patent 
application thereon was duly filed. Accordingly, the GenBank reference was 
published and available to the public until June 24, 1999, as is indicated by GenBank. 
This was one year after the filing date of the International PCT patent application 
PCT/JP98/03328, which was July 24, 1998. 



6. 



For these reasons, the GenBank reference cited by the examiner does not 



anticipate the claims of the patent application under 35 U.S.C. 102(b). Likewise, the 
GenBank reference cited by the examiner cannot properly be used as an obviousness 
reference under 35 U.S.C. 103. 

7. I further declare that all statements made herein of my own knowledge are true, 
that all statements made on information and belief are believed to be true, and further 
that these statements were made with the knowledge that willful false statements and 
the like so made are punishable by fine or imprisonment, or both, under 18 U.S.C. 
§ 1001 and that such willful false statements may jeopardize the validity of the 
application or document or any patent which may issue thereon. 




Date: November 14 , 2003 
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U.S. Patent Application No. 09/600,932 
(U.S. Phase of PCT/ JP/98/03328) 



CURRICULUM VITAE OF NOBUTAKA WAKAMIYA, M.D. 



[ Academic and Professional Backgroiind ] 

March 1980 Graduated from Hirosaki University, 

Faculty of Medicine, School of Medicine. 

July 1980 Joined to Osaka Prefectural Hospital 
as Resident Pediatrician. 

April 1982 Admitted to Doctor's Course in Department of 

Pathology (Virus III) at Graduate School of Medicine, 
Osaka University. 

March 1986 Graduated from the Doctor's Course above. 

Earned a degree of Doctor of Medicine. 

June 1986 Enrolled in Dana-Farber Cancer Institute, 
Harvard University as a Research Fellow. 

Jan. 1988 Joined to Research Institute for Microbial Diseases, 
Osaka University as an Assistant Professor. 



Nov. 2000 Joined Asahikawa Medical College as a Professor and 
chairman (the present position) . 



[ Specialized Field ] 

Biochemistry, Immunology and Microbiology 

[ Membership of Societies Belonged ] 

The Japanese Biochemical Society 
The Japanese Society for Virology 
The Japanese Society for Immunology 
American Society for Microbiology (ASM) 
American Association for Cancer Research (AACR) 

[ The Present Research Theme ] 

Biological Functions and Roles of Animal Lectin (Collectin) 
possessing Collagen Motifs. 

[ Recent Scientific Japanese P\iblications on Collectin ] 

(1) Wakamiya N. and Suzuki Y., "Collectin Families to be acted 
as Bio-Defensive Lectin", PROTEIN, NUCLEIC ACID AND ENZYME, 
Vol.45, No. 5, pp. 655-663 (2000). 

(2) Wakamiya N. and Suzuki Y., "Novel Hemangioendotheliocyte 
Scavenger Receptor CL-Pl", SEIKAGAKU, Vol.73, No. 3, 

pp. 205-208 (2001) . ^ ' 

[ Recent Scientific English Publications on Collectin ] 

1. Ohtani, K. , Suzuki, Y., Eda, S., Kawai, T., Kase, T., Yamazaki, 
H., Shimada, T., Keshi, H., Sakai, Y., Fukuoh, A., Sakamoto, 
T., Wakamiya, N . : Molecular cloning of a novel collectin from 
liver (CL-Ll) . J. Biol. Chem. 274(19): 13681-13689, 1999. 

2. Ohtani, K. , Suzuki, Y., Eda, S., Kawai, T., Kase, T., Keshi, 
H., Sakai, Y. , Fukuoh, A., Sakamoto, T., Itabe, H., Suzutani , 
T., Ogasawara, M., Yoshida, I., Wakamiya, N . : The membrane -type 
collectin CL-Pl is a scavenger receptor on vascular endothelial 
cells. J, Biol. Chem. 276(47): 44222-44228, 2001. 

** ** ** ★ ★ ★ ★ ** ** ** ** — 



Terre Rob 



From: / vA ddbjupclt@dclbj.nig.ac.jp 

Sent: ( ^ ^\ Friday, October 03. 2003 3:27 AM 

T : [ MOV 2 8 201B SI Terre Robertson 

^i>. ytateno@genes.nig.ac.jp; hsugawar@genes.nig.ac.jp; ddbjupdt@ddbj.nig.ac.jp 

Subject:^^ RE- Submission Publication Date Verification 

Dear Dr. Terre Robertson 

The sequence data with accession number AB002631 were released from the DNA Data 
Bank of Japan (DDBJ) on June 24 1999 in order to make them public. 

DDBJ is in collaboration with the EMBL Nucleotide Sequence Database in Europe 
and GenBank in USA to form and function as the International Nucleotide Sequence 
Databases . - . 

We take no responsibility for the priority and property issues for the submitted data. We 
simply inform you of the releasing date on request. 

We appreciate your understanding and cooperation. 



Sincerely yours, 

Yoshio Tateno, Ph.D. 

The Center for Information Biology and DNA Data Bank of Japan 
National Institute of Genetics 



We are not using "GI" and division is "HUM". 

As we told you before, please check AB002631 by using our retrival tool 
getentry: http://getentry.ddbj .nig.ac.jp/getstart-e.html 



>Date: Tue, 30 Sep 2003 08:54:01 -0500 

>From: "Terre Robertson" <TRobertson@marshallip . com> ,A 

>Dear Sir or Madam: You are correct that the acession no. is AB002631. 
>Version is AB002631.1 GI:5162874. Locus is AB002631 1594 bp mRNA 
> linear PRI-24 Jun 1999. Authors are Ohtani , Suzuki, Eda, Kawai, Kase, 
>Yama2aki, Keshi, sakai, Fukuoh, Sakamoto and Wakamiya. The Direct 
>Submission was made on 4 April 1997. Keywords are collectin 34. 
> 

>Medline- no . is- 99240-768- - - - — - - - — 

>PubMed No. ia 10224141 
> 

>Thank you so very much for your kind assistance with this matter. Your 
>help is deeply appreciated. 

> 

> Original Message 

>From: ddbjupdt®ddbj .nig.ac . jp [mailto : ddbjupdt®ddbj .nig.ac.jp] 
>Sent: Monday, September 29, 2003 8:31 PM 
>To: Terre Robertson 

>Cc : romiti@ncbi.nlm.nih.gov; ddbjupdt@ddbj.nig.ac.jp 
>Subject: Re: Submission Publication Date Verification 
> 
> 

>Dear Dr. Terre Robertson 
> 

>DNA Data Bank of Japan (DDBJ) has received your message at its update 
>email address by way of GenBank again. 

1 



I* » 

>As we explained in our previous mail, the accession number 
>of DDBJ/EMBL/GenBank is consisted of 

>* 1 alphabet character and 5 digits (ex. D12345) *** or **** 

>* 2 alphabet characters and 6 digits (ex. AB123456) . 

> 

>We think your request is about AB002631. 

>Please check to be sure that it is AB002631 by using our retrival tool 

>getentry : http : //getentry . ddb j . nig .ac.jp/ gets t art -e . html 

> 

>After having your confirmation, we will respond you about the data 

>as far as possbile. Please reply directly to us : ddbjupdt@ddbj .nig . ac . jp . 

>Thank you for your cooperation. 

> 

>Sincerely, 
>DDBJ update 

> 

>>Date: Mon, 22 Sep 2003 15:26:02 +0900 (JST) 
>>From: ddbjupdt@ddbj.nig.ac.jp 

>>Subject: Re: group: submission publication date verification 
>>To: TRobertson@marshallip.com 

>>Cc: romiti@ncbi.nlm.nih.gov, ddbjupdt@ddbj.nig.ac.jp 
> 

>>The accession number of DDBJ/EMBL/GenBank is consisted of 1 alphabet 
>>character 

>>and 5 digits (ex. D12345) or 2 alphabet characters and 6 digits (ex. 
>AB123456) . 

>>But your request is about "AB0022631". Please check and let us know 
>the acc 

>>again. We will respond you about the data as far as possbile. 

> 

>>Date: Mon, 29 Sep 2003 16:35:09 -0400 (EDT) 
>>From: Monica Romiti <romiti®ncbi . nlm . nih . gov> 
>>Subject: Submission Publication Date Verification 

> >To : ddb j updt@ddb j . nig .ac.jp 

> 

Begin Forwarded Message 

> 

>>Subject: Submission Publication Date Verification 
>>Date: Mon, 29 Sep 2003 15:25:39 -0500 
>>From: "Terre Robertson" <TRobertson@mar shall ip . com> 
>>To: <romiti®ncbi . nlm. nih . gov> 

=^ J\ 
> 

>>Dear Monica: Thank you for your email responses while I was out of 
>>town last week. On the printout from NCBU Sequence Viewer, there is no 

> 

>>D number. I received some email from the people at DDBJ stating that 
>>there would be a 6 digit number beginning with D- -unfortunately in my 
>>reviewing all my emails, that one got deleted. However, if you can 
>>re.contact them I would appreciate it . „ The aces s ion .no... is AB002 631. 
>>Version is AB002631.1 01:5162874. Locus is AB002631 1594 bp mRNA 
>>linear PRI-24 Jun 1999. Authors are Ohtani, Suzuki, Eda, Kawai, Kase, 
> 

>>Yamazaki, Keshi , sakai, Fukuoh, Sakamoto and Wakamiya . The Direct 
>>Submission was made on 4 April 1997. Keywords are collectin 34. 

> > 

>>Medline no. is 99240768 
>>PubMed No. ia 10224141 

>> 

>>This is all the information contained on the printout so . I do not know 
>>how to find the "D" number mentioned by the Japanese associates. Your 
>>help will be deeply appreciated. If you need me to fax the papers to 
>>you, please provide me with a fax number. Thanks so much for all your 
>>help . 
>> 

> > 

>>The material in this transmission contains confidential information 

2 



>>intended only for the addressee. If you are not the addressee, any 

>>disclosure or use of this information by you is strictly prohibited. 

>>If you have received this transmission in error, please delete it, 

>>destroy all copies, and notify Marshall, Gerstein & Borun LLP by 

>>telephone (312) 474-6300. Thank you. 

>> 

>> 

>> End Forwarded Message 

>> 
>> 

>>Best regards, 
>> 

>>Monica L. Romiti 

>>NCBI User Services 

>> 

> 

> 

> 

>The material in this transmission contains confidential information 
>intended only for the addressee. If you are not the addressee, any 
>disclosure or use of this information by you is strictly prohibited. 
>If you have received this transmission in error, please delete it, 
>destroy all copies, and notify Marshall, Gerstein & Borun LLP by 
>telephone (312) 474-6300. Thank you. 
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Blast 2 Sequences results 

PubMed Entrez BLAST OMIM Taxonom' 

BLAST 2 SEQUENCES RESULTS VERSION BLASTN 2.2.6 [Apr-09-2003] 




Match:|i; ; Mismatch:|-2 gap open:|5 1 gap extension: |2 

x_dropoff: |50 expect:jlOOOO.( wordsize: |11 i Filter T Aligri | 



Page 1 of 1 



Sequence 1 lcl|seq_l Length 1409 -'B<»fl**& (M0,ujf\U e.r aC) 

Sequence 2 lcl|seq_2 Length 1595 — /b ^JO - I OF fl0%^/ 'bCC/C 

No significant similarity was found 



/A 



http://www.ncbi.nlm.nih.gov/blast/bl2seq/wblast2.cgi70 
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Blast 2 Sequences results 

PubMed Entrez BLAST OMIM Taxonomy Structure 

BLAST 2 SEQUENCES RESULTS VERSION BLASTP 2.2.6 [Apr-09-2003] 

1 I I I 

Matrix |BLOSUM62 Jr| gap open:|1 1 gap exten sion: j1 
x dropoff: |50 expect:|lOO0O.( wordsize: |3 j Filter T Align | 

Sequence 1 lcl|seqLl Length 277 (1 .. 277) - fb -i^Ot^l ^03fe / 

Sequence 2 lcl|seq_2 Length 249 (1 .. 249) (k/^tOAII &f aC) 




NOTE:The statistics (bitscore and expect value) is calculated based on the size of nr database 
Score = 108 bits (270) , Expect = le-22 

Identities = 73/220 (33% ), Positives = 107/220 (48%), Ga ps = 15/220 (6%) 

Query : 56 GDPGEEGKHGKVGRMGPKGIKGELGDMGDRGNIGKTGPIGKKGDKGEKGLLGIPGEKGKA 115 

G PG G GK GR G KG KGE G +G G GP GK G +G G+ GIPG G+ 

Sbjct : 4 0 GPPGINGIPGKDGRDGAKGEKGEPG QGLRGSQGPPGKMGPQGTPGIPGIPGPIGQK 95 

Query: 116 GTVCD-CGRYRKFVGQLDISIARLKTSMKFVKN- -VIAGIRETEEKFYYIVQEEKNYRES 172 

G + G Y + A L++ + +KN + + + +K ++ ++ + E 

Sbjct: 96 GDPGENMGDYIRLATS ERATLQSELNQIKlSmLIFSLGKRVGKKAFFTNGKKMPFNEV 152 

Query: 173 LTHCRIRGGMLAMPKDEAANTLIADYVAKSGFFRVFIGVNDLEREGQYMFTDNTPIiQNYS 232 

TC G+AP+ N+DV+F +G+ D E EG+++ + Y 

Sbjct: 153 KTLCAQFQGRVATPMNAEENRALKDLjVTEEAF LGITDQETEGKFVDLTGKGV-TYQ 207 

Query: 233 NWNEGEPSDPYGHEDCVEMLSSGRWNDTECHLTMYFVCEF 272 

NWN+GEP++ E CV +LS G WND C + VCEF 

Sbjct: 208 NWNDGEPNNASPGEHCVTLLSDGTWNDIACSASFLTVCEF 247 



CPU time: 

Lambda 
0 .314 

Gapped 
Lambda 
0.267 



K 



0.01 user sees. 
H 



0.00 sys . sees 



0.01 total sees. 



0. 136 



K H 
0 . 0410 



0 .417 



0 . 140 



http://www.ncbi.nlm.nih.gov/blast/bl2seq/wblast2. cgi?0 
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\ 

Matrix : BLOSUM62 

Gap Penalties: Existence: 11, Extension: 1 

Number of Hits to DB : 973 

Number of Sequences : 0 

Number of extensions: 109 

Number of successful extensions: 32 

Number of sequences better than 10000.0: 1 

Number of HSP ' s better than 10000.0 without gapping: 1 

Number of HSP's successfully gapped in prelim test: 0 

Number of HSP's that attempted gapping in prelim test: 0 

Number of HSP's gapped (non-prelim) : 5 

length of query: 24 9 

length of database: 498,883,957 

effective HSP length: 124 

effective I'ength of query: 125 

effective length of database: 498,883,833 

effective search space: 62360479125 

effective search space used: 62360479125 

T: 9 

A: 40 

XI: 16 { 7.3 bits) 
X2 : 129 (49 . 7 bits) 
X3 : 129 (49.7 bits) 
31: 42 (21.9 bits) 
S2: 47 (22.7 bits) 
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